Variable-rate speech coding: Replacing unvoiced excitations by linear prediction residues of different phonemes

نویسندگان

  • Wolfram Ehnert
  • Ulrich Heute
چکیده

Afin de réduire le débit binaire de la transmission de la parole sans perte de qualité de celle-ci, nous développons un vocodeur qui utilise des méthodes differentes pour le codage des trames voisées et non voisées. Nous présentons ici une nouvelle idée de décrire des phonèmes fricatifs (sifflantes) et plosifs avec seulement 20 bit par trame de t D 20ms. Nous montrons que ces phonèmes peuvent être représentés par des coefficients de la prédiction linéaire combinés avec un signal résiduel extrait d’un autre phonème prononcé par une personne differente connue à la station réceptrice du système de codage (voir figure 1). La présente contribution décrit aussi des algorithmes qui garantissent des transitions douces dans d’autres catégories de phonèmes. En appliquant cette technique on peut considérablement réduire le débit de transmission (jusqu’à 1 kbit/seconde) pour les trames non voisées. Nous obtenons de meilleurs résultats qu’en utilisant des variantes de CELP (prédiction linéaire excitée par une table de codage) à 4 kbit/seconde. La combinaison de ce codage avec des méthodes de codage harmonique (par exemple le MBE: ‘Multiband Excitation’) pour les trames voisées resulte en un débit binaire variable de moins de 3 kbit/seconde. ABSTRACT

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design of a Variable Rate Algorithm for the CS-ACELP Coder

In 1995, 8 kb/s CS-ACELP coder of G.729 is standardized by ITU-T SG15 and it has been reported that the speech quality of G.729 is better than or equal to that of 32 kb/s ADPCM (G.726). However G.729 is the fixed rate speech coder, and it does not consider the property of voice activity in mutual conversation. If we use the voice activity, we can reduce the average bit rate in half without any ...

متن کامل

Variable-Rate CELP Based on Subband Flatness - Speech and Audio Processing, IEEE Transactions on

Code-excited linear prediction (CELP) is the predominant methodology for communications quality speech coding below 8 kbps, and several variable-rate CELP schemes have been discussed in the literature, including QCELP, the variable-rate wideband digital cellular mobile radio speech coding standard specified in IS-95. A key component of these speech coders is the detection and classification of ...

متن کامل

A High Quality Speech Coder at 600 bps

This paper presents a vocoder to obtain high quality synthetic speech at 600 bps. To reduce the bit rate, the algorithm is based on a sinusoidally excited linear prediction model which extracts few coding parameters, and three consecutive frames are grouped into a superframe and jointly vector quantization is used to obtain high coding efficiency. The inter-frame redundancy is exploited with di...

متن کامل

Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec - Vision, Image and Signal Processing, IEE Proceedings-

This paper presents several strategies to improve the performance of very low bit rate speech coders and describes a speech codec that incorporates these strategies and operates at an average bit rate of 1.2 kb/s. The encoding algorithm is based on several improvements in a mixed multiband excitation (MMBE) linear predictive coding (LPC) structure. A switched-predictive vector quantiser techniq...

متن کامل

A Pattern Recognition Approach to Voiced — Unvoiced — Silence Classification with Applications to Speech Recognition

Absb-act—In speech analysis, the voiced-unvoiced decision is usually performed in conjunction with pitch analysis. The linking of voiced-unvoiced (V-UV) decision to pitch analysis not only results in unnecessary complexity, but makes it difficult to classify short speech segments which are less than a few pitch periods in duration. In this paper, we describe a pattern recognition approach for d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997